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摘要 : MicroRNA (miRNA) 是 一 类 广泛 存在 于 真 核 生物 中 的 具有 转录 后 水 平 调控 功能 的 内 源 非 编码 小 分 
FRNA, 在 植物 中 ，miRNA 通过 对 靶 基 因 的 剪 切 或 沉默 来 实现 对 植物 生命 活动 的 调控 ， 它 是 基因 表达 调 
控 网 络 的 重要 组 成 部 分 。miR165/166 (miR166) 是 陆 生 植物 中 最 为 古老 的 MIRNA 家 族 之 一 ， 它 通过 对 3 
型 同 源 异 域 型 - 亮 氨 酸 拉链 (HD-ZIP H) 等 靶 标 的 调控 ， 在 植物 的 众多 发 育 时 期 起 着 关键 的 调控 作用 。 
本 文 分 析 了 MIR166 基因 在 陆 生 植物 中 的 进化 关系 ， 并 对 MIRIO6 在 基部 陆 生 植 物 小 立 碗 玉 (Physcomitrella 
patens) 中 的 复制 及 进化 进行 了 研究 。 此 外 ，HD-ZIP II 蛋白 是 植物 中 重要 的 一 类 转录 因子 ，miR166 对 
HD-ZIP 克基 因 的 调控 作用 在 陆地 植物 保守 的 存在 ， 本 文 对 HD-ZIP I 基因 和 miR166 在 进化 中 的 相互 作 
用 进行 了 初步 的 探讨 。 
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Evolution of MIR166 Gene Family in Land Plants 
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Abstract: MicroRNA (miRNA) is a class of endogenous non-coding small RNAs with important post-transcriptional 
regulatory roles in eukaryotes. Plant miRNAs play important roles in the post-transcriptional regulatory network through 
mediating cleavage or silencing of target mRNAs. As one of the ancient MIRNA family, miR165/166 (miR166) is a 
key regulator in land plants. In this study, we analyzed the diversity and molecular evolution of M/R166 genes in land 
plants, and identified the replication and evolution of MIR166 genes in Physcomitrella patens. Homeodomain Leucine- 
zipper of class II ( HD-ZIP III) proteins are important transcription factors in plants, HD-ZIP IIT genes are con- 
served targeting of miR166 in angiosperms, gymnosperms, ferns, and mosses. Here, a preliminary study was con- 
ducted to make clear the relationships between HD-ZIP III genes and miR166 during the evolutionary process. 
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After the first miRNA gene (lin-4) was identi- more and more miRNAs were discovered in eu- 


fied in Caenorhabditis elegans ( Lee et al., 1993) , karyotes through deep sequencing and bioinformatics 
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approaches. Those MIRNA genes with significant se- 
quence homology to each other when compared 
based on mature miRNA sequences are grouped into 
miRNA families (Ambros et al., 2003). Hundreds 
of miRNA families have been found in plants, but 
most of them are lineage-specific, suggesting that 
most known MIRNA genes arose relatively recently in 
evolutionary time scale ( Cuperus et al., 2011). 
There are only 8 families are deep conserved in land 
plants, miR166 family is one of this kind ( Nozawa et 
al., 2010). 

miRNAs play important roles in various devel- 
opmental and physiological processes (Seed germi- 
nation, vegetative growing, flowering, seeding and 
resistance to various abiotic or biotic stress) by tar- 
geting mRNAs for cleavage or translational repres- 
sion at the post-transcriptional level in the plants 
kingdoms (Reinhart et al., 2002; Carrington and 
Ambros, 2003). miR166 and its targets regulate an 
array of plant developmental processes, including 
shoot apical and lateral meristem formation, leaf po- 
larity, floral development, and vascular development 
(McConnell et al., 2001; Otsuga et al., 2001; 
Prigge et al., 2005; Jung and Park, 2007). 

An increasing number of plant miRNA genes 
have been discovered through experimental and 
bioinformatic studies in plants, and they provide an 
opportunity to study the origins and evolution of 
them. There have been some evolutionary research 
on miRNAs in animals, plants and some smaller lin- 
eage (Li et al., 2011; Hertel et al., 2012; Zhao et 
al., 2012). The miRNAs in land plants are regarded 
as deriving from a common ancestor ( Jones-Rhoad- 
es, 2012). However, it is unclear whether the miR- 
NA pathway in green alga shares a common ancestry 
with miRNA pathways of land plants, or whether it 
is independently derived. There have not found a 
miRNA conserved between land plants and algas 
( Jones-Rhoades, 2012). 

The previous research had revealed the evolu- 
tionary history of the MIR166 homologs in Arabidop- 


sis derived from duplication events, including tan- 


dem duplications and segmental duplication events 
(Maher et al., 2006). Then, Sun et al. (2012) 
characterized the MIR166 genes derived from duplica- 
tion events in rice, sorghum, and poplar. Those re- 
searches indicating that duplication events are impor- 
tant in the expansion of MIRI66 genes in flowering 
plants. In this study, we found the phenomenon is 
exist in moss too, then, the overall phylogenetic tree 
of miR166 in Physcomitrella patens was reconstruc- 
ted base on the duplication events. 

In addition to the duplication and loss events in 
the genome, the targets of miRNA is another impor- 
tant factor in the miRNA evolution. The “new” miR- 
NAs only have a few suspected targets while the con- 
served miRNAs target numerous important mRNAs 
in the eukaryotes. In the evolutionary history of miR- 
NAs, the “ancestor” miRNAs were subdivided into 
hundreds of families, and the miRNAs which cap- 
ture important targets finally evolve to the conserved 
multi-gene families ( Bompfunewerer et al., 2005; 
Hertel et al., 2012). The mature miRNA sequence 
is complementary to the mRNA targeted region, thus 
the conserved target determine the conservative pro- 
perty of miRNA. The gain of target mRNA can cause 
the expansione or functional diversification of miR- 
NA, and the loss and pseudogene change of mRNA 
can cause deletion or functional loss of miRNA. 

In this study, we reconstructed the phylogenetic 
tree of MIR166 genes in land plants, then, we iden- 
tified the duplication events originated miR166 genes 
in Physcomitrella patens and inferred their phyloge- 
netic relationship. At last, the phylogenetic distribu- 
tion of miR166 targeted- HD-ZIP II in land plants 
were constructed to study the interaction between 


miR166 and HD-ZIP III. 


1 Materials and methods 
1.1 Phylogenetic analysis of MIR166 genes in 
land plants 

The mature sequences and precursor sequences 
of miRNAs were downloaded from miRBase database 


release 19 (http: // www. mirbase. org/) ( Addition- 
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al file 1, http; //journal. kib. ac. cn/UserFiles/File/ 
ZXY.rar). The LOGO representation of mature miR- 
NAs was obtained with the WebLogo software 
( Crooks et al., 2004). 

All of the stem-loop sequences of the miR166 
family were aligned using Clustal W (Larkin et al., 
2007). Then, We excluded the incorrect aligned se- 
quences in the conserved regions of miRNA; miR- 
NA” and aligned the remaining sequences again. 
Secondary structures were produced for each se- 
quence in the alignment using RNAfold (Mohsen et 
al., 2009). Then, we used 4SALE to correct the a- 
lignment manually by considering the agreement in 
secondary structure (Seibel et al., 2006) and the 
consensus sequences and structure using RNAalifold 
( Bernhart et al., 2008). 

Neighbor-joining (NJ) phylogenetic trees based 
on the p-distance and Kimura 2-parameter distance 
were generated by MEGA version 5.0 (Tamura et al., 
2011). Bootstrap confidence values were obtained ap- 
plying by 1 000 replications, and only the clades 
with bootstrap value higher than 50 were shown. 
1.2 Identification of miR166 homologues resides 
in duplicated blocks in Physcomitrella patens 

In order to determine whether the miRNAs a- 
rose or evolved from segmental duplication events, 
genome-wide analysis was undertaken to examine 
whether a miRNA resides within a duplicated block 
as previously described (Zhang et al., 2009; Sun et 
al., 2012). First, we used NCBI map viewer (http: 
//www. ncbi. nlm. nih. gov/projects/mapview/ ) to ex- 
tract 10 protein-coding genes in flanking regions for 
each MIR166 homologues or tandemly duplicated 
miR166 clusters. Then the flanking protein-coding 
genes of every miRNA were aligned, using standa- 
lone BLAST (blastn, version 2. 2. 27) , against those 
protein-coding genes surrounding another miRNA in 
order to identify paralogs. For each miRNA pair, the 
number of protein coding genes with the best non-self 
match to protein-coding genes flanking another miR- 
NA was counted (Sun et al., 2012) ( Additional file 
2, http; //journal. kib. ac. en/UserFiles/File/ZXY. 


rar). The flanking noncoding sequences were aligned 
by Emboss to help resolve the evolutionary history of 
the miR166 family (Rice et al., 2000). 
1.3 miR166 targeted-HD-Zip III in land plants 

A total of 60 mRNA sequences of HD-Zip III 
protein were downloaded from NCBI. Then 36 se- 
quences from 9 plant species (Arabidopsis thaliana , 
Medicago truncatula, Oryza sativa, Pinus taeda, 
Populus trichocarpa, Ricinus communis, Selaginella 
moellendorffii, Solanum lycopersicum, and Sorghum 
bicolor) were collected for target prediction ( Addi- 
tional file 3, http; //journal. kib. ac. cn/UserFiles/ 
File/ZXY. rar). The miR166-targeted HD-Zip II 
mRNAs were predicted in psRNATarget under the 
default rules (Dai and Zhao, 2011). 
1.4 Phylogenetic analysis of HD-Zip IH tran- 
scription factor 

Sequences of HD-Zip II mRNA were aligned 
using CLUSTALX, then MEGA version 5.0 was 
used to construct neighbor-joining ( NJ) and maxi- 
mum likelihood ( ML) tree. Supporting values were 


assessed using 1000 replicate bootstrap tests. 


2 Results 
2.1 MIR166 is a diversity ancient miRNA gene 
family that is ubiquitously distributed in flower- 
ing plants, gymnosperms, ferns, and mosses 

So far, miRBase release 19 has collected a total 
of 209 MIR166 genes of 35 species ranged from moss 
to the higher plants ( Table 1). The precursor se- 
quences of miR166 are diversified in land plants, 
the longest precursor is 670nt and the shortest one is 
72nt ( pvu-miR166 and sbi-miR166b ) However, 
the short functional mature sequence is highly con- 
served in land plants. We also found that the mature 
miRNA were sequence characterized in the different 
lineages ( Figure 1). MIRI66 is a typical multiple- 
gene family, and there are numerous paralogs in 
many plants species ( miR166 is single copy in 7 
plants, but it’s attribute to the lack of genome and 
deeper research ). Evidences above indicate that 


MIR166 is an old miRNA family in land plants. The 
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features of miR166 precursors suggest that the 
MIR166 genes have a complex evolution history. 


Table 1 The distribution of miR166 in land plants 














Number of 
Clade Species miR166 
genes 

Bryophyta Physcomitrella patens 13(13,0) 
Fern Selaginella moellendorffii 3(1,2) 
g g Picea abies 2(2,0) 
= 2 Pinus densata 2(2,0) 
oS Pinus taeda 2(2,0) 
Brachypodium distachyon 7(5,2) 
~ Hordeum vulgare 3(2,1) 

8 Oryza sativa 14(8,6) 

S Sorghum bicolor 11(8,3) 

Zea mays 14(12,2) 
Arabidopsis lyrata 9(4,5) 
Aquilegia coerulea 5(3,2) 
Arabidopsis thaliana 9(4,5) 
Brassica napus 6(4,2) 
Cucumis melo 9(6,3) 
£ Citrus reticulata 2(2,0) 
= Citrus sinensis 5(3,1)* 
È g Citrus trifoliata 1(0,0)* 
ae Digitalis purpurea 2(1,1) 
= Gossypium hirsutum 1(1,0) 

2 Glycine max 21(6,15) 
多 2 Hevea brasiliensis 2(2,0) 

S 

Q Helianthus paradoxus 1(1,0) 
S Helianthus petiolaris 1(0,1) 
Malus domestica 9(4,5) 
Manihot esculenta 1(0,1) 
Medicago truncatula 8(6,2) 
Nicotiana tabacum 8(3,5) 

Populus trichocarpa 17(9,8) 

Phaseolus vulgaris 1(0,0)* 
Ricinus communis 5(4,1) 
Solanum lycopersicum 2(2,0) 
Salvia sclarea 1(1,0) 
Theobroma cacao 4(4,0) 
Vitis vinifera 8(2,6) 


The number in the bracket is type 1 and type 2 miR166 gene in this 


species, “ represent the miRNA genes with 5’ mature miRNAs 


2.2 Evolution of MIR166 gene in land plants 
Stem-loop sequences were usually applied to 
study the phylogenetic relationship because they are 


the most conserved parts of the miRNA genes (Li et 
al., 2011; Hertel et al., 2012; Zhao et al., 2012). 


The selections for mutations in structured RNA 
mainly arise from base-pairings to maintain the RNA 
structures while in protein-coding sequences selec- 
tion pressure disadvantages mutations in triplet codes 
that disrupt protein functions through alteration of a- 
mino-acids (Li et al., 2011). As a class of func- 
tional RNA, structure character of pre-miRNA is 
more important than its sequence character out of the 
region of miRNA; miRNA“ , just as nonsense muta- 
tion in the protein, the substitution don’t cause 
structural change in the stem-loop is non-significa- 
tion mutation. So we construct the phylogenetic tree 
of MIR166 using the structure and sequence charac- 
ters of the stem-loop. 

We aligned the stem-loop sequences of miR166, 
three precursors with 5’ mature miRNA were exclu- 
ded ( Ambros et al. , 2003). Then we used RNAfold 
to predicted their stem-loop structures, in this step, 
we classified the pre-miR166 into two types accord- 
ing to their structural conservation in the region of 
loop (Figure 2), type 1 pre-miR166 which have a 
long stem with several mismatch bases and a loop is 
canonical stem-loop structure, and there are extra 
loops in type 2 pre-miR166 cause by the abundant 
mismatch in this region which may be induced by 
nucleotide variation in the evolution history. Type 1 
is the main form of pre-miR166 in land plants, all 
the pre-miR166 in the base of land plants Physcomi- 
trella patens are belong to type 1, and type 2 pre- 
miR166 is a variation of type 1. 

So we determined to use the 127 type 1 pre-miR166 
to build the phylogenetic tree in land plants, the stem- 
loop are divided into three space partitions in the 
structure; miRNA; MIRNA” duplexes, prolonged 
stem, and loop. The most conserved miRNA; MIR- 
NA” duplexes region were retain, in the prolonged 
stems and loops which is cause by base pairing and 
base mismatch respectively , the sequence characters 
of AUCG were translated into structure characters. 
Then, we used RNAalifold to identify the consensus 
sequences and structures of prolonged stems and 


loops of pre-mirl66 (Figure 3). At last, the structure 
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Fig.2 Structure of pre-miRNA 
The stem-loop structure of pre-miRNA is divided into three parts, the 
part in green wireframe is miRNA; miRNA * duplex, the part in pur- 
ple wireframe is prolonged stem, the part in red wireframe is loop. The 
sequences of red is mature miRNA. Type | premiRNA in the left have 
a typical stem-loop secondary structure, and the premiRNA with a 


complicated loop in the right is belongs to type 2 


characters of prolonged stems and loops were transla- 
ted into sequence characters based on the consensus 
sequences and structures again ( Additional file 4, 
http: //journal. kib. ac. cn/UserFiles/File/ZXY. rar) , 
the sequences of pre-miRNAs were aligned by Clust- 
al W and then the NJ tree were constructed by 
MEGA 5.0 (Figure 4). 

In the tree, the homologous genes are highly 
conserved in moss, most of MIRI66 genes show a 
clustered tendency in monocots, they also have a 
closer neighbor relationship to the moss. However, 
their homologies in dicots showed a more diversified 
evolutionary relationship. We speculate that the 
MIRI66 gene was originated in the original group 
land plant just as moss. Then, the original seeding 
plant inherited the gene and then hand down it to the 
original Gymnosperm and flowering plants in the o- 
rigination and evolution of flowering and seeding 
plants, the ancestor of monocots should be appeared 
before the dicots even the gymnosperm. 

2.3 Evolutionary conservation analysis of miR166 
homologues derived from duplication events in 
P. patens 

Gene duplications are derived from the duplica- 

tion events which including whole-genome duplica- 


tion (WGD) , segmental duplications, and local du- 
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miRl UGAGGGGGGAUGUUGUCUGUUUCGAGGUCAUUUAGAGAUUUGAUUCUUCUAAAUGGAGUGUAUUUUUAUUGAAUUCUAAAUGACCUCGGACCAGGCUUCAUUCCUCUCAGC 
mR2 UCUGCUGUGAGGAAUGCCCCCUGGCCCGAAGCCAUGUGCCUUUUCAUGUCCACGUGGGGUAAACAUUAAAGCUUCGGACCAGGCUUCAUUCCCCUCAGCACA 
mR3 UUUUCAUGGUUGUCGAGGGGAAUGACGCCGGGUCUGAAAGAAAGAAGGGGGCACGCCGGG AAUGACAGCGGUUCCCUAGCUCUUUCGG ACCAGGCUUCAUUCCCCAUGACUCGACCAUGG 


miRl GGUCAUUUAGAGAUUUGAUUCUUCUAAAUG GAGUGUAUUUUUAUUGAAUUCUAAAUG ACC 
idddddddadddeiddddd eid dd eee ))))......... )3))3))))))3)))))) 
miR2 AGCCAUGUGCCUUUUCAUGUCCACGUGGGGUAAACAUUAAAGCU 
C0. 00000000. C000. ))))))))..))))....))) 
miR3 AAGAAGGGGGCACGCCGGG AAUGACAGCGGUUCCCUAGCUCUU 
(CC DD))...)))) 
consensus sequences AGCC AUG UGGC AAUUCGAGUC------------ UC AUAGCGAAUACAUAAAAACU 


comwersus stuctue (CCC CCC CCCCC-C0CO0G. eee sssr eee eee es 2))))))))))...)))) 


mRl AGCCAUGUGGCAAUUCGAGUUCUUCUAAAUGGAGUGUAUUUUAGCGAAUACAUAAAUAACU 


ACCEL Dee )))))))))))))))))) 
miR2 AGCCAUGUGGCAAUUCAUGUCCACGUGGGGUAAACAUAAAAACU 


CAC- COCO. COCO. 99). 9). 
miR3 AGCCAGGGGGC ACGCCGGGAAUGAC AGGAAUACAUAAA AAACU 


(CCC aaa III II) 


mRl UGAGGGGGGAUGUUGUCUGUUUCGA AGCCAUGUGGC AAUUCGAGUUCUUC UAAAUGGAGUGUAUUUUAGCGAAUAC AUAAAUAACU UCGGACCAGGCUUCAUUCCUCUCAGC 
miR2 UCUGCUG UG AGG AAUGCCCCCUGGCCCGA AGCC AUGUGGCAAUUC AUG UCC ACGUGGGGUAAAC AUAAAAACU UCGGACCAGGCUUCAUUCCCCUCAGCACA 
mR3 UUUUCAUGGUUGUCG AGGGGAAUGACGCCGGGUCUG A AGCC AGGGG GCACGCCGGG AAUG AC AGGAAUAC AUAAAAAACU UCGG ACCAGGCUUCAUUCCCCAUGACUCGACCAUGG 


Fig.3 The methods of construct phylogenetic tree of miRNA gene 
The part in red is mature and star miRNA sequences, the part in green is the sequences of loop structure, the part in black is the sequences 
of prolonged stem. In step 1, the most conserved miRNA; MIRNA duplexes region is removed, the sequences of prolonged stems and loops 
are retained. Then, the sequence characters of AUCG were translated into structure characters. After that, the consensus sequences and struc- 
tures of prolonged stems and loops are identify. Next, the structure characters of prolonged stems and loops were translated into sequence char- 


acters based on the consensus sequences and structures again. At last, the removed sequences of miRNA and miRNA * are restored 


plications that involve one or two genes known as 
tandem duplications (Maher et al., 2006). The du- 
plication events also contribute to expansion of miR- 
NA families, especially in the WGD, some protein- 
coding genes and TE will generate new miRNA 
genes ( Hertel et al., 2006; Maher et al., 2006; 
Yuan et al., 2011). Previous studies showed that 
many plant M/RNAs are resided in the genomic du- 
plication blocks, the context sequencing of them and 
them duplication is conserved to their repetition 
(Sun et al., 2012). So the flanking sequences, in- 
cluding conserved coding genes and conserved non- 
coding regions flanking can be used for identification 
the MIRNA gene in duplicated blocks. 

Maher et al. (2006) have studied the evolution 
of duplication derived MIRNA165/166 in A. thaliana , 
in their research, the overall evolution of seven 
miR166 members were reconstructed. Recent re- 
search suggests ath-miR165a is origin from ath- 
miR166b before the origin of ath-miR166a, then a 
base substitution is introduced into ath-miR165a, in 
the recent large large-scale duplication event, ath- 
miR165b and ath-miR166a are originated from the 
duplicated blocks of ath-miR165a and ath-miR166b 


respectively. It’s in accordance with the results in 
phylogenetic research above, ath-miR165a and ath- 
miR166b belong to type 1 MIRIO6 gene, ath- 
miR165b and ath-miR166b belong to type 2 MIR166 
gene, it’s also a evidence for that the type 2 MIR166 
gene is originated from type 1 MIR166 gene. 

As a genome sequenced plant, the research of 
miRNA in P. patens is concerned. Though P. patens 
is a non-flowering plant, several conserved miRNAs 
exist in flowering plants were detected in moss. 
miR166 is the one of the miRNAs which is found in 
moss earliest, Floyd and Bowman (2004) cloned a 
HD-ZIP III gene homolog PpC3HDZIP1 which con- 
tains a conserved miR166-targerted region from 
P. patens, they also cloned a putative miR166- 
guided cleavage product of PpC3HDZIP1 mRNA. 
The experiment indirectly demonstrates the exists of 
miR166 in moss. In the next study, more and more 
moss miRNA were detected with the availability of 
P. patens genomes, including 12 MIRNA166 genes 
(Arazi et al., 2005). The numerous homologies in 
moss suggest that the expansion of miR166 is consid- 
erable. However, as outlined above, miR166 family 


is more conserved in P. patens than other plants with 
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many members, so we want to reveal the intraspecif- 
ic evolution of miR166 in P. patens. 

In this study, we analysed the context of 
MIRI166 genes in P. patens to seek their flanking 
coding genes and noncoding sequences, those se- 
quences were used to identify miR166 homology 
genes origined from duplication block. Then, a total 
of 9 duplication blocks around MIRI66 genes were 
obtained in P. patens (Table 2). There are 8 out of 
13 miR166 loci were located in miRNA clusters 
( ppt-miR166cdf cluster, ppt-miR166ij cluster, and 
ppt-miR166hkm cluster), the short length of three 
miRNA clusters indicate at last one tachytelic evolu- 
tion events of tandem duplications occurred in moss 
genome , ppt-miR166m of the base substitution is or- 
igin from ppt-miR166h or ppt-miR166k in a latter 
tandem duplications event. On the other hand, the 
clusters of miR166 suggest they provide stable and 
conservative regulatory effect in the P. patens. Then 
through the conserved flanking coding gene between 
MIRNA166 genes, we found that one flanking coding 
gene of miR166e is conserved to two flanking coding 
genes of the miR166ij cluster, then, the higher con- 
served stem-loop structure between miR166e and 
miR166j indicted that miR166j is origin from the du- 
plication block of miR166e before the tandem dupli- 
cations event. There are two conserved flanking cod- 
ing genes between ppt-miR166ij and ppt-miR166a, 
then, we found there are conserved noncoding se- 
quences between ppt-miR166a and ppt-miR166ij, 
so we inferred that ppt-miR166a is a segmental du- 
plication of ppt-miR166ij cluster before the loss e- 
vent near the ppt-miR166a. 

We also reconstruct the phylogenetic tree of 
miR166 gene in the P. patens (Figure 5). In the tree, 
we found that the tandem duplications and segmental 
duplications is frequently in the moss genome, it is 
most likely to occur a large large-scale duplication e- 
vent in the genome level, but we need more evi- 
dence. The gene loss of miR166 in the moss is fre- 
quent too, there are at least 3 loss events we known. 


There is a replication block between ppt-miR166g 
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Fig.4 The phylogenetic tree of type 1 premiR166 
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Table 2 The number of conserved protein-coding genes 


in the flanking the miR166 























Duplicated Duplicated flanking conserved 
miRNAs 1 miRNAs 2 protein-coding genes 
ppt-miR 166a ppt-miR 166ij 2 
ppt-miR166b pt-miR 166e 1 
ppt-miR166b ppt-miR166g 1 
ppt-miR166g pt-miR166jj 1 
ppt-miR166cdf ppt-miR 1661 2 
ppt-miR 166¢e pt-miR 166ij 2 
ppt-miR 166e ppt-miR 1661 1 
ppt-miR166e pt-miR 166g 1 
ppt-miR166jj ppt-miR166hkm 1 
loss 
miR166a 
miR166i 
miR166j 
miR166e 
eal 
miR166g 
miR166b 
miR1661 
miR166f 
miR166c 
miR166h 
loss 
? miR166h 
miR166k 
miR166m 
loss 


Fig.5 Reconstruction of miR166 evolution in Physcomitrella patens 
Trigon is represent a segmental duplications event, 


a circle is a tandem duplications event 


and miR166e, there also a replication block of ppt- 
miR166h between ppt-miR-166i or ppt-miR166l. 
However, the evolution details of ppt-miR166g and 
ppt-miR166h is still unknown. 
2.4 Phylogenetic distribution of miR166 targe- 
ted-HD-ZIP III in land plants 

Members of the Class II HOMEODOMAIN- 
LEUCINEZIPPER transcriptional factors are plant- 
specific and involved in many plant development 
processes ( Boualem et al., 2008; Zhu et al., 2011; 
Sakaguchi and Watanabe, 2012; Zhang et al., 
2012). The shoot apical meristem (SAM) fate is 
specified by HD-ZIP III transcriptional factors (Sak- 


aguchi and Watanabe, 2012). HD-ZIP III gene is 
regulated by miR166; AGO1, in this process, 
AGO10 combined with miR166 competitively to re- 
straint the negative regulation of HD-ZIP III gene 
and maintenance of undifferentiated stem cells in the 
SAM (Liu et al., 2009). HD-ZIP III gene and 
MIRI66 gene are present in many land plants because 
they make an important contribution to maintain the 
SAM and determine adaxial/abaxial polarity in plant 
tissues. Negative regulation of HD-ZIP III by miR166 
is highly conserved in land plants, and these HD-ZIP 
III functions show diversity among different plant 
groups (Sakaguchi and Watanabe, 2012). 

Twenty-two out of 35 HD-Zip IIT genes were the 
putative target genes for miR166 family with high 
probability (Additional file 5, http; //journal. kib. 
ac. cn/UserFiles/File/ZXY. rar). The targeted re- 
gion of HD-Zip III genes is very conserved in land 
plants. We also built the phylogenetic tree of the 
eight plants ( Figure 6). There are two clades of HD- 
Zip II genes, all the P. patens HD-Zip III genes be- 
long to clade 1, they shows considerable conserved 
and it may be the reason to the evolutional characters 
of ppt-miR166. HD-ZIP III protein is a key tran- 
scription factor, it seems to be conserved in the du- 
plication events, the more copy of HD-ZIP III is a 
factor of expand of MIR166 gene. And the stability 
function of HD-ZIP III restrict the conserved of 
MIR166 gene. 


3 Discussion 

The evolutionary rate around the plant is dispro- 
portion, it’s more slower in the functional sequences 
just as protein coding genes than the junk DNA as 
pseudogene. MIRNA gene is a “special case” , the 
overall evolutionary rate of MIRNA gene is approxi- 
mate to pseudogene although it is transcribed into 
mRNA-like RNA and generate a functional RNA 
production finally. The evolutionary rate of the differ- 
ent regions in MIRNA gene is vary drastically, the 
rate of mature and star miRNA is considerably lower 
than other regions, the conservative property of ma- 


ture sequences is higher than the miRNA” sequence 
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for the mature miRNA are essential to recognize tar- 
get mRNAs as well as to make a duplex structure 
with star sequences, whereas star sequences appear 
to be essential only for keeping the duplex structure 
with mature sequences ( Nozawa et al., 2010). 
Individual miRNAs are much more evolutionary 
labile, the expansion or loss is frequently, then, the 
incongruous conservative property between mature 
sequence and precursor sequence increase the diffi- 


culty to understand the overall evolutionary of those 


miRNA gene as MIR166 through the traditional 


methods. The structure character is useful to the evo- 
lution research in the MIRNA genes. 

However, the surrounding conserved coding 
genes sequences of MIRNA gene with normal evolu- 
tionary rates, the duplication events and evolutional 
of them could be traced. So the evolution of the con- 
served flanking protein-coding gene and non-coding 
sequences give a track to the evolutional history of 
miRNAs in the same duplication blocks. The arising 
of their homologues gene always be associated to the 


duplications events in the genome, it can contribute 
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Fig.6 Phylogenetic distribution of HD-ZIP III genes in land plants. The miR166 targeted- HD-ZIP III genes are signed in yellow 
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to the research of the evolution in a smaller lineage. 

In reconstructing of the evolution history of 
MIR166 genes, the structure characteristics of pre- 
miR166 were taken into account tentatively and we 
acquired believable results. We hope miRNAs could 


be used as the phylogeny markers in the future. 
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华南 植物 研究 所 早期 史 一 一 中 山大 学 农林 植物 研究 所 史 事 (1928-1954) ， 中 国 近世 生物 学 机 构 与 人 
WAB, WRIA, 上海 交 通 大 学 出 版 社 出 版 ，2013，216 W, ISBN: 978-7-03-039267-1. 

中 国 科 学 院 华南 植物 园 前 身 是 中 山大 学 农林 植物 研究 所 ，1928 年 由 著名 植物 学 家 陈 焕 久 在 中 山 
大 学 农 科 设 立 。 陈 焕 饼 先生 以 其 雄 才 博 学 ， 致 力 于 种 类 丰富 的 华南 地 区 植物 区 系 研究 ， 使 中 大 农林 植 
物 研究 所 与 胡 先 卫 主持 的 北平 静 生 生物 调查 所 植物 部 “双星 辉 映 ”， 成 为 民国 时 期 一 南 一 北 两 个 最 重 
要 的 植物 学 研究 机 构 。1954 年 ， 中 山大 学 农林 植物 研究 所 改 隶 中 国 科 学 院 ， 成 为 中 国 科学 院 华南 村 
物 研 究 所 〈 现 名 中 国 科学 院 华南 植物 园 ) 。 长 期 以 来 ， 学 界 对 中 山大 学 农林 植物 研究 所 ( 即 1954 年 
改 隶 中 国 科 学 院 之 前 ) 这 上 段 早 期 历史 ,未 曾 有 过 系统 梳理 ,许多 人 物 和 事迹 已 淹没 不 彰 。 本 书 以 档 
案 为 主要 材料 ， 记 述 中 山大 学 农林 植物 所 发 展 脉 络 ， 事 不 分 巨细 ， 凡 有 记录 ， 即 采 入 书 中 ， 从 细微 
处 ， 或 可 见 历史 真实 。 同 时 ， 陈 焕 饼 为 该 所 芮 基 人 ， 长 期 执掌 该 所 ， 
历 密 不 可 分 ， ee 、 第 一 章 : Peni git sae 
人 、 第 二 章 : 农林 植物 所 之 创建 与 发 展 、 第 三 章 : 播 迁 与 流亡 、 人 ， 战 后 复员 、 第 五 章 : 改 隶 中 
国 科学 院 、 大 事 记 和 参考 文献 。 


西双版纳 热带 植物 园 五 十 年 ， 胡 宗 刚 著 ， 科 学 出 版 社 出 版 ; 2014，370 H. ISBN: 978-7-03-039267-1. 

中 国 科 学 院 西 双 版 纳 热带 植物 园 由 著名 植物 学 家 蔡 希 陶 主 持 创 建 ， 经 过 五 十 年 发 展 , 已 成 为 中 国 
热带 植物 学 的 重要 研究 机 构 ， 也 是 西双版纳 重要 旅游 景区 。 本 书 以 该 园 所 藏 档案 为 主要 材料 ， 兼 而 采 
访 相 关 人 士 ， 以 科学 社会 学 方法 撰写 而 成 ， 完 整 再 现 该 园 五 十 年 历程 。 由 于 机 构 几 经 分 合 改 隶 ， 本 书 
ea hg gh ge dg te 中 国 科学 院 昆 明生 态 研 究 所 等 ， 
所 涉及 主要 人 物 除 袭 硕 陶 外 ， 还 有 许 再 富 、 裴 盛 基 、 汉 耀 宗 、 陈 进 等 。 书 中 记录 了 大 量 著 名 学 者 和 知 
名 人 士 与 该 园 的 相关 活动 ， 照片 多 为 首次 公布 。 全 书 共 分 为 六 章 : 创 
建始 末 、 建 园 初 期 、 文 化 大 革命 十 年 、 改 革 之 中 求 发 展 、 分 途 发 展 、 重 新 整合 。 书 末 附 有 结语 、 编 年 
纪事 、 主 要 参考 文献 、 人 名 索引 和 后 记 。 

胡 宗 刚 先生 不 愧 为 中 国 植物 学 史 研 究 之 骄子 ! 看 过 他 过 去 十 年 发 表 的 一 系列 有 关 著 作 : 不 该 遗忘 
HHE (长 江 文 艺 出 版 社 , 2005) 、 静 生生 物 调查 所 史 稿 (山东 教育 出 版 社 , 2005) 、 胡 先 聘 先生 年 
谱 长 编 (江西 教育 出 版 社 , 2008) 、 王 文采 口述 自传 (湖南 教育 出 版 社 , 2008) 、 庐 山 植 物 园 最 初 三 十 
年 (上海 交 通 大 学 出 版 社 , 2009) 、 北 平 研究 院 植 物 学 研究 所 史 略 ( 上海 交 通 大 学 出 版 社 , 2010) ， 真 
的 让 人 赞叹 不 已 ! 不 要 说 自学 成 才 了 ， 就 是 我 等 专业 人 士 也 自 愧 不 如 。 
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